Language/OS - Multiplatform Resource Library

home *** CD-ROM | disk | FTP | other *** search

/ Language/OS - Multiplatform Resource Library / LANGUAGE OS.iso / pcl / docs.lha / cmu-user / cmu-user.info-4 < prev next >

Wrap

Text File | 1992-08-06 | 49.3 KB | 1,276 lines

Info file: cmu-user.info, -*-Text-*- produced by latexinfo-format-buffer from file: cmu-user.tex File: cmu-user.info Node: Precise Type Checking-Footnotes, Up: Precise Type Checking (3) There are a few circumstances where a type declaration is discarded rather than being used as type assertion. This doesn't affect safety much, since such discarded declarations are also not believed to be true by the compiler. (4) The initial value need not be of this type as long as the corresponding argument to the constructor is always supplied, but this will cause a compile-time type warning unless `required-argument' is used. File: cmu-user.info Node: Weakened Type Checking, Prev: Precise Type Checking, Up: Types in Python Weakened Type Checking ---------------------- When the value for the `speed' optimization quality is greater than `safety', and `safety' is not `0', then type checking is weakened to reduce the speed and space penalty. In structure-intensive code this can double the speed, yet still catch most type errors. Weakened type checks provide a level of safety similar to that of "safe" code in other Common Lisp compilers. A type check is weakened by changing the check to be for some convenient supertype of the asserted type. For example, `(integer 3 17)' is changed to `fixnum', `(simple-vector 17)' to `simple-vector', and structure types are changed to `structure'. A complex check like: (or node hunk (member :foo :bar :baz)) will be omitted entirely (i.e., the check is weakened to `*'.) If a precise check can be done for no extra cost, then no weakening is done. Although weakened type checking is similar to type checking done by other compilers, it is sometimes safer and sometimes less safe. Weakened checks are done in the same places is precise checks, so all the preceding discussion about where checking is done still applies. Weakened checking is sometimes somewhat unsafe because although the check is weakened, the precise type is still input into type inference. In some contexts this will result in type inferences not justified by the weakened check, and hence deletion of some type checks that would be done by conventional compilers. For example, if this code was compiled with weakened checks: (defstruct foo (a nil :type simple-string)) (defstruct bar (a nil :type single-float)) (defun myfun (x) (declare (type bar x)) (* (bar-a x) 3.0)) and `myfun' was passed a `foo', then no type error would be signalled, and we would try to multiply a `simple-vector' as though it were a float (with unpredictable results.) This is because the check for `bar' was weakened to `structure', yet when compiling the call to `bar-a', the compiler thinks it knows it has a `bar'. Note that normally even weakened type checks report the precise type in error messages. For example, if `myfun''s `bar' check is weakened to `structure', and the argument is false, then the error will be: Type-error in MYFUN: NIL is not of type BAR However, there is some speed and space cost for signalling a precise error, so the weakened type is reported if the `speed' optimization quality is `3' or `debug' quality is less than `1': Type-error in MYFUN: NIL is not of type STRUCTURE ? for further discussion of the `optimize' declaration. File: cmu-user.info Node: Getting Existing Programs to Run, Prev: Types in Python, Up: The Compiler, Next: Compiler Policy Getting Existing Programs to Run ================================ Since Python does much more comprehensive type checking than other Lisp compilers, Python will detect type errors in many programs that have been debugged using other compilers. These errors are mostly incorrect declarations, although compile-time type errors can find actual bugs if parts of the program have never been tested. Some incorrect declarations can only be detected by run-time type checking. It is very important to initially compile programs with full type checks and then test this version. After the checking version has been tested, then you can consider weakening or eliminating type checks. This applies even to previously debugged programs. Python does much more type inference than other Common Lisp compilers, so believing an incorrect declaration does much more damage. The most common problem is with variables whose initial value doesn't match the type declaration. Incorrect initial values will always be flagged by a compile-time type error, and they are simple to fix once located. Consider this code fragment: (prog (foo) (declare (fixnum foo)) (setq foo ...) ...) Here the variable `foo' is given an initial value of false, but is declared to be a `fixnum'. Even if it is never read, the initial value of a variable must match the declared type. There are two ways to fix this problem. Change the declaration: (prog (foo) (declare (type (or fixnum null) foo)) (setq foo ...) ...) or change the initial value: (prog ((foo 0)) (declare (fixnum foo)) (setq foo ...) ...) It is generally preferable to change to a legal initial value rather than to weaken the declaration, but sometimes it is simpler to weaken the declaration than to try to make an initial value of the appropriate type. Another declaration problem occasionally encountered is incorrect declarations on `defmacro' arguments. This probably usually happens when a function is converted into a macro. Consider this macro: (defmacro my-1+ (x) (declare (fixnum x)) `(the fixnum (1+ ,x))) Although legal and well-defined CMU Common Lisp, this meaning of this definition is almost certainly not what the writer intended. For example, this call is illegal: (my-1+ (+ 4 5)) The call is illegal because the argument to the macro is `(+ 4 5)', which is a `list', not a `fixnum'. Because of macro semantics, it is hardly ever useful to declare the types of macro arguments. If you really want to assert something about the type of the result of evaluating a macro argument, then put a `the' in the expansion: (defmacro my-1+ (x) `(the fixnum (1+ (the fixnum ,x)))) In this case, it would be stylistically preferable to change this macro back to a function and declare it inline. Macros have no efficiency advantage over inline functions when using Python. ?. Some more subtle problems are caused by incorrect declarations that can't be detected at compile time. Consider this code: (do ((pos 0 (position #\a string :start (1+ pos)))) ((null pos)) (declare (fixnum pos)) ...) Although `pos' is almost always a `fixnum', it is false at the end of the loop. If this example is compiled with full type checks (the default), then running it will signal a type error at the end of the loop. If compiled without type checks, the program will go into an infinite loop (or perhaps `position' will complain because `(1+ nil)' isn't a sensible start.) Why? Because if you compile without type checks, the compiler just quietly believes the type declaration. Since `pos' is always a `fixnum', it is never nil, so `(null pos)' is never true, and the loop exit test is optimized away. Such errors are sometimes flagged by unreachable code notes (?), but it is still important to initially compile any system with full type checks, even if the system works fine when compiled using other compilers. In this case, the fix is to weaken the type declaration to `(or fixnum null)'. (5) (*Note Getting Existing Programs to Run-Footnotes::) Note that there is usually little performance penalty for weakening a declaration in this way. Any numeric operations in the body can still assume the variable is a `fixnum', since false is not a legal numeric argument. Another possible fix would be to say: (do ((pos 0 (position #\a string :start (1+ pos)))) ((null pos)) (let ((pos pos)) (declare (fixnum pos)) ...)) This would be preferable in some circumstances, since it would allow a non-standard representation to be used for the local `pos' variable in the loop body (see section ?.) In summary, remember that ALL values that a variable EVER has must be of the declared type, and that you should test using safe code initially. File: cmu-user.info Node: Getting Existing Programs to Run-Footnotes, Up: Getting Existing Programs to Run (5) Actually, this declaration is totally unnecessary in Python, since it already knows `position' returns a non-negative `fixnum' or false. File: cmu-user.info Node: Compiler Policy, Prev: Getting Existing Programs to Run, Up: The Compiler, Next: Open Coding and Inline Expansion Compiler Policy =============== The policy is what tells the compiler HOW to compile a program. This is logically (and often textually) distinct from the program itself. Broad control of policy is provided by the `optimize' declaration; other declarations and variables control more specific aspects of compilation. * Menu: * The Optimize Declaration:: * The Optimize-Interface Declaration:: File: cmu-user.info Node: The Optimize Declaration, Prev: Compiler Policy, Up: Compiler Policy, Next: The Optimize-Interface Declaration The Optimize Declaration ------------------------ The `optimize' declaration recognizes six different QUALITIES. The qualities are conceptually independent aspects of program performance. In reality, increasing one quality tends to have adverse effects on other qualities. The compiler compares the relative values of qualities when it needs to make a trade-off; i.e., if `speed' is greater than `safety', then improve speed at the cost of safety. The default for all qualities (except `debug') is `1'. Whenever qualities are equal, ties are broken according to a broad idea of what a good default environment is supposed to be. Generally this downplays `speed', `compile-speed' and `space' in favor of `safety' and `debug'. Novice and casual users should stick to the default policy. Advanced users often want to improve speed and memory usage at the cost of safety and debuggability. If the value for a quality is `0' or `3', then it may have a special interpretation. A value of `0' means "totally unimportant", and a `3' means "ultimately important." These extreme optimization values enable "heroic" compilation strategies that are not always desirable and sometimes self-defeating. Specifying more than one quality as `3' is not desirable, since it doesn't tell the compiler which quality is most important. These are the optimization qualities: `speed' How fast the program should is run. `speed 3' enables some optimizations that hurt debuggability. `compilation-speed' How fast the compiler should run. Note that increasing this above `safety' weakens type checking. `space' How much space the compiled code should take up. Inline expansion is mostly inhibited when `space' is greater than `speed'. A value of `0' enables promiscuous inline expansion. Wide use of a `0' value is not recommended, as it may waste so much space that run time is slowed. ? for a discussion of inline expansion. `debug' How debuggable the program should be. The quality is treated differently from the other qualities: each value indicates a particular level of debugger information; it is not compared with the other qualities. *Note Compiler Policy Control:: for more details. `safety' How much error checking should be done. If `speed', `space' or `compilation-speed' is more important than `safety', then type checking is weakened (*Note Weakened Type Checking::). If `safety' if `0', then no run time error checking is done. In addition to suppressing type checks, `0' also suppresses argument count checking, unbound-symbol checking and array bounds checks. `extensions:inhibit-warnings' This is a CMU extension that determines how little (or how much) diagnostic output should be printed during compilation. This quality is compared to other qualities to determine whether to print style notes and warnings concerning those qualities. If `speed' is greater than `inhibit-warnings', then notes about how to improve speed will be printed, etc. The default value is `1', so raising the value for any standard quality above its default enables notes for that quality. If `inhibit-warnings' is `3', then all notes and most non-serious warnings are inhibited. This is useful with `declare' to suppress warnings about unavoidable problems. File: cmu-user.info Node: The Optimize-Interface Declaration, Prev: The Optimize Declaration, Up: Compiler Policy The Optimize-Interface Declaration ---------------------------------- The `extensions:optimize-interface' declaration is identical in syntax to the `optimize' declaration, but it specifies the policy used during compilation of code the compiler automatically generates to check the number and type of arguments supplied to a function. It is useful to specify this policy separately, since even thoroughly debugged functions are vulnerable to being passed the wrong arguments. The `optimize-interface' declaration can specify that arguments should be checked even when the general `optimize' policy is unsafe. Note that this argument checking is the checking of user-supplied arguments to any functions defined within the scope of the declaration, `not' the checking of arguments to Common Lisp primitives that appear in those definitions. The idea behind this declaration is that it allows the definition of functions that appear fully safe to other callers, but that do no internal error checking. Of course, it is possible that arguments may be invalid in ways other than having incorrect type. Functions compiled unsafely must still protect themselves against things like user-supplied array indices that are out of bounds and improper lists. See also the :context-declarations option to with-compilation-unit *Note Compilation Units::. File: cmu-user.info Node: Open Coding and Inline Expansion, Prev: Compiler Policy, Up: The Compiler Open Coding and Inline Expansion ================================ Since CMU Common Lisp forbids the redefinition of standard functions (6) (*Note Open Coding and Inline Expansion-Footnotes::), the compiler can have special knowledge of these standard functions embedded in it. This special knowledge is used in various ways (open coding, inline expansion, source transformation), but the implications to the user are basically the same: * Attempts to redefine standard functions may be frustrated, since the function may never be called. Although it is technically illegal to redefine standard functions, users sometimes want to implicitly redefine these functions when they are debugging using the `trace' macro. Special-casing of standard functions can be inhibited using the `notinline' declaration. * The compiler can have multiple alternate implementations of standard functions that implement different trade-offs of speed, space and safety. This selection is based on the compiler policy, *Note Compiler Policy::. When a function call is open coded, inline code whose effect is equivalent to the function call is substituted for that function call. When a function call is closed coded, it is usually left as is, although it might be turned into a call to a different function with different arguments. As an example, if `nthcdr' were to be open coded, then (nthcdr 4 foobar) might turn into (cdr (cdr (cdr (cdr foobar)))) or even (do ((i 0 (1+ i)) (list foobar (cdr foobar))) ((= i 4) list)) If `nth' is closed coded, then (nth x l) might stay the same, or turn into something like: (car (nthcdr x l)) In general, open coding sacrifices space for speed, but some functions (such as `car') are so simple that they are always open-coded. Even when not open-coded, a call to a standard function may be transformed into a different function call (as in the last example) or compiled as static call. Static function call uses a more efficient calling convention that forbids redefinition. File: cmu-user.info Node: Open Coding and Inline Expansion-Footnotes, Up: Open Coding and Inline Expansion (6) See the proposed X3J13 "lisp-symbol-redefinition" cleanup. File: cmu-user.info Node: Advanced Compiler Use and Efficiency Hints, Prev: The Compiler, Up: Top, Next: UNIX Interface Advanced Compiler Use and Efficiency Hints ****************************************** By Robert MacLachlan * Menu: * Advanced Compiler Introduction:: * More About Types in Python:: * Type Inference:: * Source Optimization:: * Tail Recursion:: * Local Call:: * Block Compilation:: * Inline Expansion:: * Object Representation:: * Numbers:: * General Efficiency Hints:: * Efficiency Notes:: * Profiling:: File: cmu-user.info Node: Advanced Compiler Introduction, Prev: Advanced Compiler Use and Efficiency Hints, Up: Advanced Compiler Use and Efficiency Hints, Next: More About Types in Python Advanced Compiler Introduction ============================== In CMU Common Lisp, as is any language on any computer, the path to efficient code starts with good algorithms and sensible programming techniques, but to avoid inefficiency pitfalls, you need to know some of this implementation's quirks and features. This chapter is mostly a fairly long and detailed overview of what optimizations Python does. Although there are the usual negative suggestions of inefficient features to avoid, the main emphasis is on describing the things that programmers can count on being efficient. The optimizations described here can have the effect of speeding up existing programs written in conventional styles, but the potential for new programming styles that are clearer and less error-prone is at least as significant. For this reason, several sections end with a discussion of the implications of these optimizations for programming style. * Menu: * Types:: * Optimization:: * Function Call:: * Representation of Objects:: * Writing Efficient Code:: File: cmu-user.info Node: Types, Prev: Advanced Compiler Introduction, Up: Advanced Compiler Introduction, Next: Optimization Types ----- Python's support for types is unusual in three major ways: * Precise type checking encourages the specific use of type declarations as a form of run-time consistency checking. This speeds development by localizing type errors and giving more meaningful error messages. *Note Precise Type Checking::. Python produces completely safe code; optimized type checking maintains reasonable efficiency on conventional hardware (?.) * Comprehensive support for the CMU Common Lisp type system makes complex type specifiers useful. Using type specifiers such as `or' and `member' has both efficiency and robustness advantages. ?. * Type inference eliminates the need for some declarations, and also aids compile-time detection of type errors. Given detailed type declarations, type inference can often eliminate type checks and enable more efficient object representations and code sequences. Checking all types results in fewer type checks. See sections ? and ?. File: cmu-user.info Node: Optimization, Prev: Types, Up: Advanced Compiler Introduction, Next: Function Call Optimization ------------ The main barrier to efficient Lisp programs is not that there is no efficient way to code the program in Lisp, but that it is difficult to arrive at that efficient coding. Common Lisp is a highly complex language, and usually has many semantically equivalent "reasonable" ways to code a given problem. It is desirable to make all of these equivalent solutions have comparable efficiency so that programmers don't have to waste time discovering the most efficient solution. Source level optimization increases the number of efficient ways to solve a problem. This effect is much larger than the increase in the efficiency of the "best" solution. Source level optimization transforms the original program into a more efficient (but equivalent) program. Although the optimizer isn't doing anything the programmer couldn't have done, this high-level optimization is important because: * The programmer can code simply and directly, rather than obfuscating code to please the compiler. * When presented with a choice of similar coding alternatives, the programmer can chose whichever happens to be most convenient, instead of worrying about which is most efficient. Source level optimization eliminates the need for macros to optimize their expansion, and also increases the effectiveness of inline expansion. See sections ? and ?. Efficient support for a safer programming style is the biggest advantage of source level optimization. Existing tuned programs typically won't benefit much from source optimization, since their source has already been optimized by hand. However, even tuned programs tend to run faster under Python because: * Low level optimization and register allocation provides modest speedups in any program. * Block compilation and inline expansion can reduce function call overhead, but may require some program restructuring. See sections ?, ? and ?. * Efficiency notes will point out important type declarations that are often missed even in highly tuned programs. ?. * Existing programs can be compiled safely without prohibitive speed penalty, although they would be faster and safer with added declarations. ?. File: cmu-user.info Node: Function Call, Prev: Optimization, Up: Advanced Compiler Introduction, Next: Representation of Objects Function Call ------------- The sort of symbolic programs generally written in Common Lisp often favor recursion over iteration, or have inner loops so complex that they involve multiple function calls. Such programs spend a larger fraction of their time doing function calls than is the norm in other languages; for this reason Common Lisp implementations strive to make the general (or full) function call as inexpensive as possible. Python goes beyond this by providing two good alternatives to full call: * Local call resolves function references at compile time, allowing better calling sequences and optimization across function calls. ?. * Inline expansion totally eliminates call overhead and allows many context dependent optimizations. This provides a safe and efficient implementation of operations with function semantics, eliminating the need for error-prone macro definitions or manual case analysis. Although most CMU Common Lisp implementations support inline expansion, it becomes a more powerful tool with Python's source level optimization. See sections ? and ?. Generally, Python provides simple implementations for simple uses of function call, rather than having only a single calling convention. These features allow a more natural programming style: * Proper tail recursion. ? * Relatively efficient closures. * A `funcall' that is as efficient as normal named call. * Calls to local functions such as from `labels' are optimized: * Control transfer is a direct jump. * The closure environment is passed in registers rather than heap allocated. * Keyword arguments and multiple values are implemented more efficiently. ?. File: cmu-user.info Node: Representation of Objects, Prev: Function Call, Up: Advanced Compiler Introduction, Next: Writing Efficient Code Representation of Objects ------------------------- Sometimes traditional Common Lisp implementation techniques compare so poorly to the techniques used in other languages that Common Lisp can become an impractical language choice. Terrible inefficiencies appear in number-crunching programs, since Common Lisp numeric operations often involve number-consing and generic arithmetic. Python supports efficient natural representations for numbers (and some other types), and allows these efficient representations to be used in more contexts. Python also provides good efficiency notes that warn when a crucial declaration is missing. See section ? for more about object representations and numeric types. Also ? about efficiency notes. File: cmu-user.info Node: Writing Efficient Code, Prev: Representation of Objects, Up: Advanced Compiler Introduction Writing Efficient Code ---------------------- Writing efficient code that works is a complex and prolonged process. It is important not to get so involved in the pursuit of efficiency that you lose sight of what the original problem demands. Remember that: * The program should be correct -- it doesn't matter how quickly you get the wrong answer. * Both the programmer and the user will make errors, so the program must be robust -- it must detect errors in a way that allows easy correction. * A small portion of the program will consume most of the resources, with the bulk of the code being virtually irrelevant to efficiency considerations. Even experienced programmers familiar with the problem area cannot reliably predict where these "hot spots" will be. The best way to get efficient code that is still worth using, is to separate coding from tuning. During coding, you should: * Use a coding style that aids correctness and robustness without being incompatible with efficiency. * Choose appropriate data structures that allow efficient algorithms and object representations (?). Try to make interfaces abstract enough so that you can change to a different representation if profiling reveals a need. * Whenever you make an assumption about a function argument or global data structure, add consistency assertions, either with type declarations or explicit uses of `assert', `ecase', etc. During tuning, you should: * Identify the hot spots in the program through profiling (section ?.) * Identify inefficient constructs in the hot spot with efficiency notes, more profiling, or manual inspection of the source. See sections ? and ?. * Add declarations and consider the application of optimizations. See sections ?, ? and ?. * If all else fails, consider algorithm or data structure changes. If you did a good job coding, changes will be easy to introduce. File: cmu-user.info Node: More About Types in Python, Prev: Advanced Compiler Introduction, Up: Advanced Compiler Use and Efficiency Hints, Next: Type Inference More About Types in Python ========================== This section goes into more detail describing what types and declarations are recognized by Python. The area where Python differs most radically from previous Common Lisp compilers is in its support for types: * Precise type checking helps to find bugs at run time. * Compile-time type checking helps to find bugs at compile time. * Type inference minimizes the need for generic operations, and also increases the efficiency of run time type checking and the effectiveness of compile time type checking. * Support for detailed types provides a wealth of opportunity for operation-specific type inference and optimization. * Menu: * More Types Meaningful:: * Canonicalization:: * Member Types:: * Union Types:: * The Empty Type:: * Function Types:: * The Values Declaration:: * Structure Types:: * The Freeze-Type Declaration:: * Type Restrictions:: * Type Style Recommendations:: File: cmu-user.info Node: More Types Meaningful, Prev: More About Types in Python, Up: More About Types in Python, Next: Canonicalization More Types Meaningful --------------------- CMU Common Lisp has a very powerful type system, but conventional Common Lisp implementations typically only recognize the small set of types special in that implementation. In these systems, there is an unfortunate paradox: a declaration for a relatively general type like `fixnum' will be recognized by the compiler, but a highly specific declaration such as `(integer 3 17)' is totally ignored. This is obviously a problem, since the user has to know how to specify the type of an object in the way the compiler wants it. A very minimal (but rarely satisfied) criterion for type system support is that it be no worse to make a specific declaration than to make a general one. Python goes beyond this by exploiting a number of advantages obtained from detailed type information. Using more restrictive types in declarations allows the compiler to do better type inference and more compile-time type checking. Also, when type declarations are considered to be consistency assertions that should be verified (conditional on policy), then complex types are useful for making more detailed assertions. Python "understands" the list-style `or', `member', `function', array and number type specifiers. Understanding means that: * If the type contains more information than is used in a particular context, then the extra information is simply ignored, rather than derailing type inference. * In many contexts, the extra information from these type specifier is used to good effect. In particular, type checking in `Python' is PRECISE, so these complex types can be used in declarations to make interesting assertions about functions and data structures (*Note Precise Type Checking::.) More specific declarations also aid type inference and reduce the cost for type checking. For related information, ? for numeric types, and section ? for array types. File: cmu-user.info Node: Canonicalization, Prev: More Types Meaningful, Up: More About Types in Python, Next: Member Types Canonicalization ---------------- When given a type specifier, Python will often rewrite it into a different (but equivalent) type. This is the mechanism that Python uses for detecting type equivalence. For example, in Python's canonical representation, these types are equivalent: (or list (member :end)) == (or cons (member nil :end)) This has two implications for the user: * The standard symbol type specifiers for `atom', `null', `fixnum', etc., are in no way magical. The null type is actually defined to be `(member nil)', list is `(or cons null)', and fixnum is `(signed-byte 30)'. * When the compiler prints out a type, it may not look like the type specifier that originally appeared in the program. This is generally not a problem, but it must be taken into consideration when reading compiler error messages. File: cmu-user.info Node: Member Types, Prev: Canonicalization, Up: More About Types in Python, Next: Union Types Member Types ------------ The member type specifier can be used to represent "symbolic" values, analogous to the enumerated types of Pascal. For example, the second value of `find-symbol' has this type: (member :internal :external :inherited nil) Member types are very useful for expressing consistency constraints on data structures, for example: (defstruct ice-cream (flavor :vanilla :type (member :vanilla :chocolate :strawberry))) Member types are also useful in type inference, as the number of members can sometimes be pared down to one, in which case the value is a known constant. File: cmu-user.info Node: Union Types, Prev: Member Types, Up: More About Types in Python, Next: The Empty Type Union Types ----------- The or (union) type specifier is understood, and is meaningfully applied in many contexts. The use of `or' allows assertions to be made about types in dynamically typed programs. For example: (defstruct box (next nil :type (or box null)) (top :removed :type (or box-top (member :removed)))) The type assertion on the `top' slot ensures that an error will be signalled when there is an attempt to store an illegal value (such as `:rmoved'.) Although somewhat weak, these union type assertions provide a useful input into type inference, allowing the cost of type checking to be reduced. For example, this loop is safely compiled with no type checks: (defun find-box-with-top (box) (declare (type (or box null) box)) (do ((current box (box-next current))) ((null current)) (unless (eq (box-top current) :removed) (return current)))) Union types are also useful in type inference for representing types that are partially constrained. For example, the result of this expression: (if foo (logior x y) (list x y)) can be expressed as `(or integer cons)'. File: cmu-user.info Node: The Empty Type, Prev: Union Types, Up: More About Types in Python, Next: Function Types The Empty Type -------------- The type false is also called the empty type, since no object is of type false. The union of no types, `(or)', is also empty. Python's interpretation of an expression whose type is false is that the expression never yields any value, but rather fails to terminate, or is thrown out of. For example, the type of a call to `error' or a use of `return' is false. When the type of an expression is empty, compile-time type warnings about its value are suppressed; presumably somebody else is signalling an error. If a function is declared to have return type false, but does in fact return, then (in safe compilation policies) a `"NIL Function returned"' error will be signalled. See also the function required-argument *Note Compile Time Type Errors::. File: cmu-user.info Node: Function Types, Prev: The Empty Type, Up: More About Types in Python, Next: The Values Declaration Function Types -------------- function types are understood in the restrictive sense, specifying: * The argument syntax that the function must be called with. This is information about what argument counts are acceptable, and which keyword arguments are recognized. In Python, warnings about argument syntax are a consequence of function type checking. * The types of the argument values that the caller must pass. If the compiler can prove that some argument to a call is of a type disallowed by the called function's type, then it will give a compile-time type warning. In addition to being used for compile-time type checking, these type assertions are also used as output type assertions in code generation. For example, if `foo' is declared to have a `fixnum' argument, then the `1+' in `(foo (1+ x))' is compiled with knowledge that the result must be a fixnum. * The types the values that will be bound to argument variables in the function's definition. Declaring a function's type with `ftype' implicitly declares the types of the arguments in the definition. Python checks for consistency between the definition and the `ftype' declaration. Because of precise type checking, an error will be signalled when a function is called with an argument of the wrong type. * The type of return value(s) that the caller can expect. This information is a useful input to type inference. For example, if a function is declared to return a `fixnum', then when a call to that function appears in an expression, the expression will be compiled with knowledge that the call will return a `fixnum'. * The type of return value(s) that the definition must return. The result type in an `ftype' declaration is treated like an implicit `the' wrapped around the body of the definition. If the definition returns a value of the wrong type, an error will be signalled. If the compiler can prove that the function returns the wrong type, then it will give a compile-time warning. This is consistent with the new interpretation of function types and the `ftype' declaration in the proposed X3J13 "function-type-argument-type-semantics" cleanup. Note also, that if you don't explicitly declare the type of a function using a global `ftype' declaration, then Python will compute a function type from the definition, providing a degree of inter-routine type inference, ?. File: cmu-user.info Node: The Values Declaration, Prev: Function Types, Up: More About Types in Python, Next: Structure Types The Values Declaration ---------------------- CMU Common Lisp supports the `values' declaration as an extension to CMU Common Lisp. The syntax is `(values type1 type2 ... TYPEN)'. This declaration is semantically equivalent to a `the' form wrapped around the body of the special form in which the `values' declaration appears. The advantage of `values' over the is purely syntactic -- it doesn't introduce more indentation. For example: (defun foo (x) (declare (values single-float)) (ecase x (:this ...) (:that ...) (:the-other ...))) is equivalent to: (defun foo (x) (the single-float (ecase x (:this ...) (:that ...) (:the-other ...)))) and (defun floor (number &optional (divisor 1)) (declare (values integer real)) ...) is equivalent to: (defun floor (number &optional (divisor 1)) (the (values integer real) ...)) In addition to being recognized by `lambda' (and hence by `defun'), the `values' declaration is recognized by all the other special forms with bodies and declarations: `let', `let*', `labels' and `flet'. Macros with declarations usually splice the declarations into one of the above forms, so they will accept this declaration too, but the exact effect of a `values' declaration will depend on the macro. If you declare the types of all arguments to a function, and also declare the return value types with `values', you have described the type of the function. Python will use this argument and result type information to derive a function type that will then be applied to calls of the function (*Note Function Types::.) This provides a way to declare the types of functions that is much less syntactically awkward than using the `ftype' declaration with a `function' type specifier. Although the `values' declaration is non-standard, it is relatively harmless to use it in otherwise portable code, since any warning in non-CMU implementations can be suppressed with the standard `declaration' proclamation. File: cmu-user.info Node: Structure Types, Prev: The Values Declaration, Up: More About Types in Python, Next: The Freeze-Type Declaration Structure Types --------------- Because of precise type checking, structure types are much better supported by Python than by conventional compilers: * The structure argument to structure accessors is precisely checked -- if you call `foo-a' on a `bar', an error will be signalled. * The types of slot values are precisely checked -- if you pass the wrong type argument to a constructor or a slot setter, then an error will be signalled. This error checking is tremendously useful for detecting bugs in programs that manipulate complex data structures. An additional advantage of checking structure types and enforcing slot types is that the compiler can safely believe slot type declarations. Python effectively moves the type checking from the slot access to the slot setter or constructor call. This is more efficient since caller of the setter or constructor often knows the type of the value, entirely eliminating the need to check the value's type. Consider this example: (defstruct coordinate (x nil :type single-float) (y nil :type single-float)) (defun make-it () (make-coordinate :x 1.0 :y 1.0)) (defun use-it (it) (declare (type coordinate it)) (sqrt (expt (coordinate-x it) 2) (expt (coordinate-y it) 2))) `make-it' and `use-it' are compiled with no checking on the types of the float slots, yet `use-it' can use `single-float' arithmetic with perfect safety. Note that `make-coordinate' must still check the values of `x' and `y' unless the call is block compiled or inline expanded (?.) But even without this advantage, it is almost always more efficient to check slot values on structure initialization, since slots are usually written once and read many times. File: cmu-user.info Node: The Freeze-Type Declaration, Prev: Structure Types, Up: More About Types in Python, Next: Type Restrictions The Freeze-Type Declaration --------------------------- The `extensions:freeze-type' declaration is a CMU extension that enables more efficient compilation of user-defined types by asserting that the definition is not going to change. This declaration may only be used globally (with `declaim' or `proclaim'). Currently `freeze-type' only affects structure type testing done by `typep', `typecase', etc. Here is an example: (declaim (freeze-type foo bar)) This asserts that the types `foo' and `bar' and their subtypes are not going to change. This allows more efficient type testing, since the compiler can open-code a test for all possible subtypes, rather than having to examine the type hierarchy at run-time. File: cmu-user.info Node: Type Restrictions, Prev: The Freeze-Type Declaration, Up: More About Types in Python, Next: Type Style Recommendations Type Restrictions ----------------- Avoid use of the `and', `not' and `satisfies' types in declarations, since type inference has problems with them. When these types do appear in a declaration, they are still checked precisely, but the type information is of limited use to the compiler. `and' types are effective as long as the intersection can be canonicalized to a type that doesn't use `and'. For example: (and fixnum unsigned-byte) is fine, since it is the same as: (integer 0 MOST-POSITIVE-FIXNUM) but this type: (and symbol (not (member :end))) will not be fully understood by type interference since the `and' can't be removed by canonicalization. Using any of these type specifiers in a type test with `typep' or `typecase' is fine, since as tests, these types can be translated into the `and' macro, the `not' function or a call to the satisfies predicate. File: cmu-user.info Node: Type Style Recommendations, Prev: Type Restrictions, Up: More About Types in Python Type Style Recommendations -------------------------- Python provides good support for some currently unconventional ways of using the CMU Common Lisp type system. With Python, it is desirable to make declarations as precise as possible, but type inference also makes some declarations unnecessary. Here are some general guidelines for maximum robustness and efficiency: * Declare the types of all function arguments and structure slots as precisely as possible (while avoiding `not', `and' and `satisfies'). Put these declarations in during initial coding so that type assertions can find bugs for you during debugging. * Use the member type specifier where there are a small number of possible symbol values, for example: `(member :red :blue :green)'. * Use the or type specifier in situations where the type is not certain, but there are only a few possibilities, for example: `(or list vector)'. * Declare integer types with the tightest bounds that you can, such as `(integer 3 7)'. * Define deftype or defstruct types before they are used. Definition after use is legal (producing no "undefined type" warnings), but type tests and structure operations will be compiled much less efficiently. * In addition to declaring the array element type and simpleness, also declare the dimensions if they are fixed, for example: (simple-array single-float (1024 1024)) This bounds information allows array indexing for multi-dimensional arrays to be compiled much more efficiently, and may also allow array bounds checking to be done at compile time. ?. * Avoid use of the the declaration within expressions. Not only does it clutter the code, but it is also almost worthless under safe policies. If the need for an output type assertion is revealed by efficiency notes during tuning, then you can consider `the', but it is preferable to constrain the argument types more, allowing the compiler to prove the desired result type. * Don't bother declaring the type of let or other non-argument variables unless the type is non-obvious. If you declare function return types and structure slot types, then the type of a variable is often obvious both to the programmer and to the compiler. An important case where the type isn't obvious, and a declaration is appropriate, is when the value for a variable is pulled out of untyped structure (e.g., the result of `car'), or comes from some weakly typed function, such as `read'. * Declarations are sometimes necessary for integer loop variables, since the compiler can't always prove that the value is of a good integer type. These declarations are best added during tuning, when an efficiency note indicates the need. File: cmu-user.info Node: Type Inference, Prev: More About Types in Python, Up: Advanced Compiler Use and Efficiency Hints, Next: Source Optimization Type Inference ============== Type inference is the process by which the compiler tries to figure out the types of expressions and variables, given an inevitable lack of complete type information. Although Python does much more type inference than most Common Lisp compilers, remember that the more precise and comprehensive type declarations are, the more type inference will be able to do. * Menu: * Variable Type Inference:: * Local Function Type Inference:: * Global Function Type Inference:: * Operation Specific Type Inference:: * Dynamic Type Inference:: * Type Check Optimization:: File: cmu-user.info Node: Variable Type Inference, Prev: Type Inference, Up: Type Inference, Next: Local Function Type Inference Variable Type Inference ----------------------- The type of a variable is the union of the types of all the definitions. In the degenerate case of a let, the type of the variable is the type of the initial value. This inferred type is intersected with any declared type, and is then propagated to all the variable's references. The types of multiple-value-bind variables are similarly inferred from the types of the individual values of the values form. If multiple type declarations apply to a single variable, then all the declarations must be correct; it is as though all the types were intersected producing a single and type specifier. In this example: (defmacro my-dotimes ((var count) &body body) `(do ((,var 0 (1+ ,var))) ((>= ,var ,count)) (declare (type (integer 0 *) ,var)) ,@body)) (my-dotimes (i ...) (declare (fixnum i)) ...) the two declarations for `i' are intersected, so `i' is known to be a non-negative fixnum. In practice, this type inference is limited to lets and local functions, since the compiler can't analyze all the calls to a global function. But type inference works well enough on local variables so that it is often unnecessary to declare the type of local variables. This is especially likely when function result types and structure slot types are declared. The main areas where type inference breaks down are: * When the initial value of a variable is a untyped expression, such as `(car x)', and * When the type of one of the variable's definitions is a function of the variable's current value, as in: `(setq x (1+ x))' File: cmu-user.info Node: Local Function Type Inference, Prev: Variable Type Inference, Up: Type Inference, Next: Global Function Type Inference Local Function Type Inference ----------------------------- The types of arguments to local functions are inferred in the same was as any other local variable; the type is the union of the argument types across all the calls to the function, intersected with the declared type. If there are any assignments to the argument variables, the type of the assigned value is unioned in as well. The result type of a local function is computed in a special way that takes tail recursion (?) into consideration. The result type is the union of all possible return values that aren't tail-recursive calls. For example, Python will infer that the result type of this function is `integer': (defun ! (n res) (declare (integer n res)) (if (zerop n) res (! (1- n) (* n res)))) Although this is a rather obvious result, it becomes somewhat less trivial in the presence of mutual tail recursion of multiple functions. Local function result type inference interacts with the mechanisms for ensuring proper tail recursion mentioned in section ?.